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Abstract 

Given an input graph G and an integer fc, the fc-PATH problem asks whether there exists a path 
of length k in G. The counting version of the problem, #fc-PATH asks to find the number of paths 
of length k in G. Recently, there has been a lot of work on finding and counting fc-sized paths in 
an input graph. The current fastest (randomized) algorithm for fc-PATH has been given by Williams 
and it runs in time O* (2 ) [IPL, 2009]. The randomized algorithm for finding a fc-path in the input 
graph was recently generalized by Koutis and Williams for testing whether there exists a subgraph 
in the input graph which is isomorphic to a given fc-vertex tree [ICALP, 2009]. Bjorklund, Husfeldt, 
Kaski, and Koivisto [ESA, 2009] gave a deterministic algorithm for #fc-PATH running in time and 
space C*(( fe " 2 )) on an m P ut 8 ra P n with n vertices and gave a polynomial space algorithm running 
in time 0*(3 fc / 2 ( fe ™ 2 )). 

In this paper we study a natural generalization of both fc-PATH and fc-TREE problems, namely, 
the Subgraph Isomorphism problem. In the Subgraph Isomorphism problem we are given 
two graphs F and G on k and n vertices respectively as an input, and the question is whether there 
exists a subgraph of G isomorphic to F. We show that if the treewidth of F is at most t, then there 
is a randomized algorithm for the SUBGRAPH ISOMORPHISM problem running in time 0*(2 k n 2t ). 
To do so, we associate a new multivariate Homomorphism polynomial of degree at most fc with 
the Subgraph Isomorphism problem and construct an arithmetic circuit of size at most 
for this polynomial. Using this polynomial, we also give a deterministic algorithm to count the 
number of homomorphisms from F to G that takes time and uses polynomial space. For 

the counting version of the SUBGRAPH ISOMORPHISM problem, where the objective is to count 
the number of distinct subgraphs of G that are isomorphic to F, we give a deterministic algorithm 
running in time and space 0*((J/ 2 ) n2p ) or (/ £ ™ 2 ) nC '^ loS '^ '■ ^ e a ^ so S' ve an algorithm running in 
time 0*(2 fe ( fc " 2 )n 5p ) and taking space polynomial in n. Here p and t denote the pathwidth and 
the treewidth of F, respectively. Thus our work not only improves on known results on SUBGRAPH 
Isomorphism but it also extends and generalize most of the known results on fc-PATH and fc-TREE. 



1 Introduction 

In this paper we consider the classical problem of finding and counting a fixed pattern graph F on k 
vertices in an n-vertex host graph G, when we restrict the treewidth of the pattern graph F by t. More 
precisely the problems we consider are the SUBGRAPH ISOMORPHISM problem and the #SUBGRAPH 
Isomorphism problem. In the Subgraph Isomorphism problem we are given two graphs F and 
G on k and n vertices respectively as an input, and the question is whether there exists a subgraph in 
G which is isomorphic to F. In the #SUBGRAPH ISOMORPHISM problem the objective is to count the 
number of distinct subgraphs of G that are isomorphic to F. Recently #SUBGRAPH ISOMORPHISM, in 
particular when F has bounded treewidth, has found applications in the study of biomolecular networks. 
We refer to Alon et al. and references there in for further details. 
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In a seminal paper Alon et al. @ introduced the method of COLOR-CODING for the SUBGRAPH 
ISOMORPHISM problem, when the treewidth of the pattern graph is bounded by t and obtained random- 
ized as well as deterministic algorithms running in time 2°^n°^ . This algorithm was derandomized 
using /c -perfect hash families. In particular, Alon et al. [4] gave a randomized 0*(5.4 fc fl time algorithm 
and a deterministic O*{o k ) time algorithm, where c a large constant, for the /c-Path problem, a special 
case of Subgraph Isomorphism where F is a path of length k. Using this algorithm for /c-Path, 
Alon et al [4] also resolved a conjecture of Papadimitriou and Yannakakis [22] that for k = O(logn), 
the /c-Path problem can be solved in polynomial time. There has been a lot of efforts in parameterized 
algorithms to reduce the base of the exponent of both deterministic as well as the randomized algorithms 
for the /c-PATH problem. In the first of such attempts, Chen et al. ifTTTl and Kneis at al. [ 19 ] independently 
discovered the method of Divide and Color and gave a randomized algorithm for /c-Path running in 
time 0*(A k ). Chen et al. O also gave a deterministic algorithm running in time O* (A k+0 ( k ">) using an 
application of universal sets. While the best known deterministic algorithm for /c-Path problem still runs 
in time 0*(A k+0 ^), the base of the exponent of the randomized algorithm for the /c-PATH problem has 
seen a drastic improvement. Koutis [20] introduced an algebraic approach based on group algebras for 
/c-PATH and gave a novel randomized algorithm running in time 0*(2 3fc / 2 ) = 0*(2.83 fc ). Williams 11241 
augmented the approach of Koutis l20l with more random choices and several other ideas and gave the 
current fastest algorithm for /c-Path running in time 0*(2 k ). The best known algorithms for finding a 
Hamilton path, case k = n for the /c-Path problem, in an n-vertex graph run 0*(2 n ) time and are 
quite old j6l[T4l[T5]]. Any significant improvement in the run time dependence on k given by Williams' 
algorithm would imply a faster HAMILTON PATH algorithm, which has been an open problem for over 
forty years. 

While there has been a lot of work on the /c-Path problem, there has been almost no progress 
on other cases of the SUBGRAPH ISOMORPHISM problem until this year. Amini et al. [5] introduced 
an inclusion-exclusion based approach in the classical COLOR-CODING and using it gave a random- 
ized 5A k n°^ time algorithm and a deterministic 5A k+ °^n°^ time algorithm for the SUBGRAPH 
ISOMORPHISM problem, when F has ueewidth at most t. Koutis and Williams [21] generalized their 
algebraic approach for /c-PATH to /c-Tree, a special case of SUBGRAPH ISOMORPHISM problem where 
F is a tree on A;- vertices, and obtained a randomized algorithm running in time 0*(2 k ) for /c-Tree. 
Our first result fills this gap and generalizes the results of Koutis and Williams [21] and Williams |[24ft 
for /c-Tree and /c-PATH respectively, to the case when the pattern graph F has treewidth at most t. 
More precisely, we give a randomized algorithm for the SUBGRAPH ISOMORPHISM problem running in 
time 0*(2 k (nt) t ), when the treewidth of F is at most t. In general our approach follows the road map 
suggested by Koutis and Williams IlTTTI and Williams l24l . which is based on reducing the problem to 
checking a multilinear term in a specific polynomial of degree at most k. Our first non-trivial contribu- 
tion is a new polynomial of degree at most k, namely the Homomorphism polynomial, relating graph 
homomorphisms and injective graph homomorphisms for testing whether a graph contains a subgraph 
which is isomorphic to a fixed graph F. We show that if the treewidth of the pattern graph F is bounded 
by t then we can make an arithmetic circuit of size for the Homomorphism polynomial which 

combined with a result of Williams [24] yields our first theorem. In fact, what we have is an arithmetic 
formula and using this we give a deterministic algorithm to count homomorphisms from F to G which 
runs in time 0*((nt) 2t ) and takes space polynomial in n, when the treewidth of F is t. This is not only 
crucial for our polynomial space algorithm for counting subgraphs but also substantially improves the 
space requirement, from k t+l log n to polynomial in n and k, of the previous algorithm for counting 
graph homomorphisms of Diaz et al. lfl2l . 

In the second part of the paper we consider the problem of counting the number of pattern subgraphs, 
that is, the #SUBGRAPH ISOMORPHISM problem. The algorithm given in (HdUCEIl for finding a hamil- 
tonian path can in fact count the number of hamiltonian paths in the input graph in time O* (2 n ). Hence, 
a natural question is whether we can solve the #SUBGRAPH ISOMORPHISM problem in 0*(c k ) time, 



'Throughout this paper O* () notation hides factors polynomial in the instance size n and the parameter k. 
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when the fc-vertex graph F is of bounded treewidth or whether we can even solve the #/c-PATH problem 
in 0*(c k ) time? Flum and Grohe US showed that the #fc-PATH problem is #W[l]-hard and hence it 
is very unlikely that the #A;-PATH problem can be solved in time f(k)n°^ where / is any arbitrary 
function of k. In another negative result, Alon and Gutner [31 have shown that one can not hope to solve 
#fc-PATH better than 0{n k l 2 ) using the method of COLOR-CODING. They show this by proving that any 
family T of "balanced hash functions" from {1, . . . , n} to {1, . . . , k}, must have size VL{n k / 2 ). On the 
positive side, very recently Vassilevska and Williams ll23l studied various counting problems and among 
other results gave an algorithm for the #fc-PATH problem running in time 0*{2 k (k/2)\ ( fc ™ 2 ) ) an< ^ s P ace 
polynomial in n. Bjorklund et al. (8] introduced the method of "meet-in-the-middle" and gave an algo- 
rithm for the #fc-PATH problem running in time and space 0*(( yk r J 2 )). They also gave an algorithm for 

#fc-PATH problem running in time O* (3 fc / 2 ( fe %) ) and polynomial space, improving on the polynomial 
space algorithm given in J23J. We extend these results to the #SUBGRAPH ISOMORPHISM problem, 
when the pattern graph F is of bounded treewidth or pathwidth. And here also graph homomorphisms 
come into play. By making use of graph homomorphisms we succeed to extend the applicability of the 
meet-in-the-middle method to much more general structures than paths. Combined with other tools — 
inclusion-exclusion, the DISJOINT Sum problem, separation property of graph of bounded treewidth or 
pathwidth and the trimmed variant of Yate's algorithm presented in Q — we obtain the following results. 
Let F be a k- vertex graph and G be an n- vertex graph of pathwidth p and treewidth t. Then #Sub GRAPH 
ISOMORPHISM is solvable in times 0*{( k n /2 )n 2 P) and ( fc / 2 )n°( tIogfc ) and space C7*(( fc %) ). We also 
give an algorithm for #SUBGRAPH ISOMORPHISM that runs in time C*(2 fe ( fc " 2 )n 3p t 2 ') (respectively 

2 k ( k n /2 ) n 0(*i°s*O) and takes polynomial space. Thus our work not only improves on known results on 
Subgraph Isomorphism of Alon et al. (4| and Amini et al. Q but it also extends and generalize 
most of the known results on /c-Path and &-TREE of Bjorklund et al. (H, Koutis and Williams l2ll and 
Williams l24ll . 

The main theme of both algorithms, for finding and for counting a fixed pattern graph F, is to 
use graph homomorphisms as the main tool. Counting homomorphisms between graphs has found ap- 
plications in variety of areas, including extremal graph theory, properties of graph products, partition 
functions in statistical physics and property testing of large graphs. We refer to the excellent survey of 
Borgs et al. (9] for more references on counting homomorphisms. In Q, for the first time, it was used 
to design exact and parameterized algorithms. One of the main advantages of using graph homomor- 
phisms is that in spite of their expressive power, graph homomorphisms between many structures can 
be counted efficiently. Secondly, it allows us to generalize various algorithm for counting subgraphs 
with an ease. We also combine counting homomorphisms with the recent advancements on computing 
different transformations efficiently on subset lattice. Our deterministic polynomial space algorithm for 
counting graph homomorphisms uses arithmetic formula and it appears that this method could be useful 
in designing polynomial space variant of other exact algorithms. 

2 Preliminaries 

Let G be a simple undirected graph without self loops and multiple edges. We denote the vertex set of 
G by V(G) and the set of edges by E(G). For a subset W C V(G), by G[W] we mean the subgraph of 
G induced by W. We refer to Appendix [6j] for the standard definitions of treewidth, pathwidth and nice 
tree decomposition. 

Graph Homomorphisms: Given two graphs F and G, a graph homomorphism from F to G is a map 
/ from V(F) to V(G), that is / : V(F) V(G), such that if uv G E{F), then f(u)f(v) G E(G). 
Furthermore, when the map / is injective, / is called an injective homomorphism. Given two graphs F 
and G, the problem of SUBGRAPH ISOMORPHISM asks whether there exists an injective homomorphism 
from F to G. By hom(F, G), inj(F, G) and sub(F, G) we denote the number of homomorphisms from 
F to G, the number of injective homomorphisms from F to G and the number of distinct copies of F in 
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G, respectively. We denote by aut(F, F) the number of automorphisms from F to itself, that is bijective 
homomorphisms. The set HOM(F, G) denotes the set of homomorphisms from F to G. 

Functions on the Subset Lattice: For two functions /i : D\ — > R\ and /2 : D2 — > R% such that for 
every x € D\ n D2, fi (x) = f'2 (x) we define the gluing operation /1 © /2 to be a function from D\ U Z?2 
to i?i U R2 such that /1 © f2(x) = fi(x) if x £ D\ and /1 © /2(x) = f2(x) otherwise. 

For a universe U of size n, we consider functions from 2 U (the family of all subsets of U) to Z. 
For such a function / : 2^ — > Z, the zeta transform of / is a function /( : 2 17 -» Z such that 
fC(S) = X^xcs f {■%■)■ Given /, computing /£ using this equation in a naive manner takes time 0*(3 n ). 
However, one can do better, and compute the zeta transform in time 0*(2 n ) using a classical algorithm 
of Yates 11251 . In this paper we will use a "trimmed" variant of Yates's algorithm [7] that works well 
when the non-zero entries of / all are located at the bottom of the subset lattice. In particular, it was 
shown in Q that if f(X) only can be non-zero when \X\ < k then /£ can be computed from / in time 
0*(Yli=i ("))■ m our algorithm we will also use an efficient algorithm for the DISJOINT Sum problem, 
defined as follows. Input is two families A and B of subsets of U and two weight functions a : A —> Z 
and (3 : B — ► Z. The objective is to calculate 



Following an algorithm of Kennes fl6l . Bjorkund et al. HI gave an algorithm to compute AM Bin time 
G(n(\ lA\ + \i B\)), where \ A = {X : 3A G A, X C A} is the down-closure of A. 
Arithmetic Circuits and Formula: An arithmetic circuit (or a straight line program) C over a specified 
ring K is a directed acyclic graph with nodes labeled from {+, x} U {x\, . . . , x n } U K, where X = 
{xi, . . . , x n } are the input variables of C. Nodes with zero out-degree are called output nodes and those 
with labels from X U IK are called input nodes. The Size of an arithmetic circuit is the number of gates 
in it. The Depth of C is the length of the longest path between an output node and an input node. A 
formula is an arithmetic circuit where every node has out-degree bounded by 1, that is, the underlying 
undirected graph is a tree. The nodes in C are sometimes referred to as gates. It is not hard to see that 
with every output gate g of the circuit C we can associate a polynomial / € K[x\, . . . , x n ]. For more 
details on arithmetic circuits see iflQl fTll. 

A polynomial / £ K[xi, . . . , x n ] is said to have a multilinear term if there is a term of the form 
cs Yiies x i w i tn c s/f an d / 5* C {1, . . . , n} in the standard monomial expansion of /. 

3 Algorithm for Finding a Subgraph 

In this section we give our first result and show that the Subgraph Isomorphism problem can be 
solved in time 0*(2 k (nt) t ) when the pattern graph F has treewidth at most t. The main idea of our 
algorithm follows that of Koutis and Williams ll2T1l and Williams ll24l for the /c-Tree problem and the 
/c-Path problem, respectively. However, we need additional ideas for our generalizations. Our second 
result of this section is a polynomial space algorithm to count graph homomorphisms between F and G 
when the treewidth of F is at most t. 

First, given two graphs F and G, we will associate a polynomial Vg(X) where X = {x v \ v € 
V(G)} such that: (a) the degree of Vg{X) is k; (b) there is a one to one correspondence between the 
monomials of Vg and homomorphisms between F and G; and (c) Vg contains a multilinear monomial 
of degree k if and only if G contains a subgraph isomorphic to F. The polynomial we associate with F 
and G to solve the SUBGRAPH ISOMORPHISM problem is given by the following. 




Homomorphism Polynomial = Vg(xi, . . . , x. 





*eHOM(F, G) uaV{F) 



We first show that Vg is "efficiently" computable by an arithmetic circuit. 
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Lemma 1. Let F and G be given two graphs with \V{F)\ = k and \ V(G)\ = n. Then the polynomial 
V G (xi, ■ ■ ■ , x n ) is computable by an arithmetic circuit of size 0*((ni)*) where t is the tree-width of F. 

Proof. Let F, G, k, n and t be as given in the lemma. Let D = (U, T, r) be a nice tree decomposition of 
F rooted at r. We define a polynomial f G (T, r, U T , S, ip) G Z[X], where 

• r is a node in T; 

• U T C V(F) is the vertex subset associated with r; 

• S be a multi-set (an element can repeat itself) of size at most t + 1 with elements from the set 

V(G); 

• tp : F[U T ] — > G[S] is a multiplicity respecting homomorphism between the subgraphs induced by 
U T and S respectively; and 

• X = {x v \v G V(G)} is the set of variables. 

Let V T denote the union of vertices contained in the bags corresponding to the nodes of subtree of T 
rooted at r. At an intuitive level f G (T, r, U T , S, ip) represents the polynomial which contains sum of 
monomials of the form FLeViAt/T x <t>(u)y where is a homomorphism between F[V T ] and G consistent 
with ip, that is, (j> is an extension of ip to F[y r ]. Formally, the polynomial fa can be defined inductively 
by going over the tree T bottom up as follows. 

Case 1 (base case): The node r is a leaf node in T. Since V T = U T , there is only one homomorphism 
between F[V T ] and G that is an extension of ip, hence fc{T, t,U t ,S,i/j) = 1. 

Case 2: The node r is a join node. Let n and T2 be the two children of r and T\ and T2 denote the sub- 
trees rooted at t\ and T2 respectively. Note that U T = U Tl = U T2 and (V Tl (~]V T2 )\U T = 0. Hence, 
any extension of ip to a homomorphism between i^V^J and G is independent of an extension of 
tp to a homomorphism between i^V^J and G. Thus we have, 

f G (T,T,U T ,S,^) = f G (T 1 ,T 1 ,U Tl ,S,^)f G (T 2 ,^,S,U n ,^ (1) 

Case 3: The node r is an introduce node in T, let n be the only child of r, and {u} = U T \ U T1 . Also, 
let Ti denote the sub-tree of T rooted at t\ . In this case any extension of tp to a homomorphism 
between F[V T ] and G is in fact an extension of ip\u T1 and thus we get 

f G (T, r, U T , S, = f G (T u n, U T1 ,S\ Mu)}, ^\ Uti ). (2) 

Case 4: The node r is a forget node in T, and n is the only child of r in T. Now, U T1 contains an 
extra vertex along with U T . Thus any extension of i/j to a homomorphism between F[V T ] and 
G is a direct sum of an extension of ip to include u and that of V T1 , where {it} = U T1 \ U T . 

Define, F = |v | u G V(G),Vio G ?7 T ,u;u G ^ G £?(G)}. For u G F, let 

^ : U T1 — > 5 U {«} be such that ^„|[/ T = V' an d ^(^) = u - Then, 

[0 otherwise. 

Let HOM(U r , G) denote the set of all homomorphisms between the subgraph of F induced by U r 
and G. In order to consider all homomorphisms between F and G, we run through all homomorphisms 
ijj between F[U r ] and G, and then compute f G (T,r,U r , Image(ip),ip) multiplied by the monomial 
corresponding to ip. Now we define, 

H G (T,r,U r )= f G (T,r,U r ,S,p,iP)( J] x v ) (4) 

i>e¥lOM(Ur,G) ueu r ,v=tp(u) 
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where, we consider the set S^p = Image (-0) as a multi set. Now we need to show that Hq is efficiently 
computable and Vg = *Hg- We first show that TLq is computable by an arithmetic circuit of size 
0*((ntf). 

Claim 1. Hg(T, r, U r ) is a polynomial of degree k and is computable by an arithmetic circuit of size 
0*((n£)*). Here r is the root of the tree T. 

Proof. In the above definition of Jg, the only place where the degree of the polynomial increases is at 
forget nodes of T. The number of forget nodes in T is exactly k — \U r \. Thus the degree of any Jg is 
k — \U r \ and hence the degree of Hg is k. 

From the definitions in Equations £T||4]) above, Hg(T, r, U r ) can be viewed as an arithmetic circuit 
C with X = {x v \v G V(G)} as variables and gates from the set {+, x }. Any node of C is labeled either 
by a variables from U or a function of the form Jg(T, t, U t , S, ip). The size of the circuit is bounded by 
the number of possible labelings of the form Jg{T, t, U t , S, ip), where T and U T are fixed. But this is 
bounded by \V(T)\ ■ n t+1 • (t + = (nt) t+ °^ = 0*{(nt) 1 ). □ 

Next we show that TCg defined above is precisely Vg an d satisfies all the desired properties. 

Claim 2. Let 4> '■ V(F) — > V(G). Then <p G HOM(F,G) if and only if the monomial \\ u& y^ nas 
a non-zero coefficient in TIq(T, r, U r ). In other words, we have that 

H G (T,r, U r ) = V G (xi,...,x n ) = ^2 ] [ X^ u y 

0eHOM(F,G) ueV(F) 

Proof. We first give the forward direction of the proof. Let 4> €HOM(F, G) and ip = 4>\u r - We show 
an expansion of Hg{T, r, U r ) which contains the monomial n«Gy(F) x <t>{u)- We first choose the term 
/g(T, r, U r ,Sjf,,ijj) x Ilue^r x i>{u)- We expand /g{T, r, U r ,S^p, ip) further according to the tree struc- 
ture of T. We describe this in a generic way. Consider the expansion of fc{T', r, U T , S, \)- If T is a join 
node we recursively expand both the sub polynomials according to Equation (Q]). When r is an introduce 
node we use Equation Q. In the case when r is a forget node, we first note that Y ^ (this is the same 
Y as defined in Case 4) and also that 4>{u) G Y, where u € U T \ U T1 . The last assertion follows from the 
definition of Y. Here, we choose the term which contains x^r u \ , note that there exists exactly one such 
term and proceed recursively. 

Let M denote the monomial obtained by the above mentioned expansion. For any node v € V(G), 
we have deg M (x v ) = )|, where deg M (x v ) denotes the degree of the variable x v in the monomial 

M . To see this, in the tree decomposition D, a node u € V(F) enters the tree through a unique forget 
node and this is exactly where the variable x^ u ^ is multiplied. Thus we have M = n«gy(F) x 4>{u)- 
Note that this expansion is uniquely defined for a given (j). 

For the reverse direction, consider an expansion p of H.G(T,r,U r ) into monomials and let M = 
Y\ Xy v be a monomial of p, where Y^d v = k. We build a 4> G HOM(F,G) using p and the structure of T. 
Let fc{T, r, U r , S^jip) be the first term chosen using Equation ((U). For every u € U r let <j>{u) = ip( u )- 
Inductively suppose that we are at a node r and let T 1 be the corresponding subtree of T. In the case of 
Equations (OQ) and (0) there is no need to do anything. In the case of Equation (f3]), where r is a forget 
node, with u G U Tl \ U T . If the expansion p chooses the term /g(Ti,ti, U Tl , S U {v }, ip v ) x x v , then 
we set <f)(u) = v. 

It remains to show that the map cj> : V(F) — > V{G) as built above is indeed a homomorphism. We 
prove this by showing that for any edge uv! G E(F) we have that cj)(u)(j)(u') G E[G). If uvl is an edge 
such that both u, u' G U r then we are done, as by definition <j>\u r G HOM(U r , G) and thus 4> preserves 
all the edges between the vertices from U r . So we assume that at least one of the end points of the edge 
uu' is not in U r . By the property of tree decomposition there is a r' G T such that {u, u'} G U T '. Now 
since at least one of the endpoints of uu' is not in U r , there is a node on the path between r and r' such 
that either u or u' is forgotten. Let t" be the first node on the path starting from r' to r in the tree T such 
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that it does not contain both u and u'. Without loss of generality let u £ U T n and thus t" is a forget node 
which forgets u. At any forget node, since the target node v is from the set Y, we have that preserves 
the edge relationships among the vertices in U T n and u. Now from Equation (O, the property of Y and 
the fact that u' £ U T » we have that <f>(u)(f>(u') € E(G). □ 

Now by setting Vg{X) = Hg(T, r, U r ) the lemma follows which concludes the proof. □ 

We also need the following proposition proved by Williams [24], which tests if a polynomial of 
degree k has a multilinear monomial with non-zero coefficient in time 0(2 k s(n)) where s(n) is the size 
of the arithmetic circuit. 

Proposition 1 ( 11241 ). Let P(x±, . . . , x n ) be a polynomial of degree at most k, represented by an arith- 
metic circuit of size s(n) with + gates (of unbounded fan-in), x gates (of fan-in two), and no scalar 
multiplications. There is a randomized algorithm that on every P runs in 0(2 k s(n)n ^) time, outputs 
"yes" with high probability if there is a multilinear term in the sum-product expansion of P, and always 
outputs "no" if there is no multilinear term. 

Lemma [T]and Proposition Q] together yield our first theorem. 

Theorem 1. Let F and G be two graphs on k and n vertices respectively and tw(F) < t. Then, there 
is a randomized algorithm for the SUBGRAPH ISOMORPHISM problem that runs in time 0*(2 k (nt) t ). 

Counting Homomorphisms: We note that the polynomial Vg can De used to count the number of 
homomorphisms from F to G. Let hom(F, G) denote the number of homomorphisms from F to G. 
Then we have, hom(F, G) = Vg(1, •••;!)• First we make the following observations: 

• Hg can in fact be computed by an arithmetic formula of size n ^\ In fact the straightforward 
circuit construction described above gives a formula. 

• The depth of is bounded by 1 + depth of T and the depth of T is bounded by 0(k). 

We need the following proposition. 

Proposition 2 (Folklore). [★] H Given an arithmetic formula of depth d and size s, $ can be evaluated 
at (1, ... , 1) in time O(s) and 0(d{d + log s)) bits of space. 

A naive implementation of the above procedure would require + 0(d(d + logs)) bits of 

space, as we may need to store the whole formula in the memory for an evaluation. Here, we give 
an implementation which reduces the space requirement to C(n(log k + tlogn)) bits. The idea is not 
to store the entire formula, instead to have a space efficient algorithm that given (u,i,T) as an input, 
outputs i th child of a node u in <J>. 

Lemma 2. [*] Given a nice tree decomposition of F, and a label (T, r, U, S, ip) of a node u in the for- 
mula $ the following can be computed in 0{(n+k)(\og k+t log n)) bits of space and time 0{Size(§)): 
(a) the number of children of u in and (b) the label of the i th child of u in 

We can implement the algorithm given in Proposition [2]by using the algorithm of Lemma[2] whenever 
we need to access an edge of the arithmetic formula <£. Note that even though every call to the algorithm 
of Lemma[2]requires 0((n + k) (log k + t log re)) bits of space, this can be reused among different calls. 
Thus the total space requirement is 0(k(k + t log re) + n(log k + tlogn)). Also, the overall running 
time of the evaluation procedure for $ will be 0*((ni)* • Size(^)) = 0*((nt) 2t ). Hence we have the 
following: 

2 Proofs of results labeled with [*] have been moved to the appendix due to space restrictions. 
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Theorem 2. Let F and G be two graphs on k and n vertices respectively and tw(F) < t. Then, 
the number of homomorphisms from F to G, homfi* 1 , G), can be computed in time 0*((nt) 2t ) and 
0(k 2 + (k + n)t log ra ) bits of space. 

This is a substantial improvement on space compared to the 0{k t+1 logra) bound of lfl2l . In fact, 
our algorithm takes polynomial number of bits, irrespective of the treewidth t. However the running 
time of our algorithm is 0*{{nt) 2t ) compared to the 0*(k t ) time bound of lfT2l . 

4 Algorithms for Counting Subgraphs 

In this section, we give algorithms for the #SUBGRAPH ISOMORPHISM problem, when F has either 
bounded treewidth or pathwidth. 

4.1 Counting Subgraphs with Meet in The Middle 

When 1^(^)1 = k, the pathwidth of F is p and |^(G)| = n, then the running time of our algorithm 
for #Subgraph Isomorphism is (9(( fc %)n 2p+c> ( 1 )). Roughly speaking, our algorithm decomposes 
V(F) into three parts, the left part L, the right part R, and the separator S. Then the algorithm guesses 
the position of S in G, and for each such position counts the number of ways to map L and R into G, 
such that the mappings can be glued together at 5. Thus our result is a generalization of the meet in 
the middle algorithm for #/c-Path in an n-vertex graph by Bjorklund et al. (H. However, our algorithm 
differs from that of Bjorklund et al. (H conceptually in two important points. First, we count the number 
of injective homomorphisms from F to G instead of counting the number of subgraphs of G that are 
isomorphic to F. To get the number of subgraphs of G that are isomorphic to F we simply divide the 
number of injective homomoiphisms from F to G by the number of automorphisms of F. The second 
difference is that we give an algorithm that given a fc-vertex graph F of pathwidth p and an n-vertex 
graph G computes in time 0*((^)n p ) the number of injective homomorphisms from F to G[S] for 
every fc-vertex subset S of G. In the #A;-PATH algorithm of Bjorklund et al. [8], a simple dynamic 
programming algorithm to count k -paths in G[S] for every A;- vertex subset S, running in time 0* (()!)) 
is presented, however this algorithm does not seem to generalize to more complicated pattern graphs 
F. Interestingly, our algorithm to compute the number of injective homomorphisms from F to G[S] for 
every S is instead based on inclusion-exclusion and the trimmed variant of Yates's algorithm presented 
in Q. In order to implement the meet-in-the-middle approach, we will use the following fact about 
graphs of bounded pathwidth. 

Proposition 3 (Folklore). [*] Let F be a k-vertex graph of pathwidth p. Then there exists a partitioning 
ofV(F) into V(F) = L tt) S l±l R, such that \S\ < p, \L\, \R\ < k/2 and no edge of F has one endpoint 
in L and the other in R. 

Let V(F) = Ll±)5l+)i?bea partitioning of V(F) as given by Proposition and let L + = L U 

5 and R+ = RU S. For a map g : S -> V(G) and a set S' such that S C S' and a set Q we 
define hom g (F[S'], G[Q]) to be the number of injective homomorphisms from F[S'] to G[Q] coinciding 
with g on S. Similarly we let inj (F[S'],Q) to be the number of homomorphisms from F to G[Q] 
coinciding with g on S. If we guess how an injective homomorphism maps F[S] to G we get inj(-F, G) = 
^2 g injg{F, G), where the sum is taken over all injective maps g from S to V(G). For a given map g, 
we define the set of families C g = {Q C V(G) : \Q\ = \L\} and TZ g = {Q C V(G) : \Q\ = \R\}. 
The weight of a set Q € C g is defined as a g (Q) = mj g (F[L + ], G[Q U g(S)]) and the weight of a set 
Q G U g is set to af{Q) = mj g (F[R+],G[Q U g(S)}). 

For any Q\ e C g and Q2 € TZg such that Q\ n Q2 = 0, if we take an injective homomorphism 
h\ from F[L + ] to G[Q\ U g(S)] coinciding with g on S and another injective homomorphism h% from 
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F[R + ] to G[Q2 U g(S)] coinciding with g on S and glue them together, we obtain an injective homo- 
morphism hi © h% from F to G. Furthermore two homomorphisms from F to G can only be equal 
if they coincide on all vertices of F. Thus, if Q[ G £ 9 , (3 2 € 7£ g and h[ and /i' 2 are injective homo- 
morphisms from F[L + ] to G[Q[ U 5(5)] and from F[R + ] to G[Q' 2 U g(S')] respectively we have that 
h\®h,2 = h'i © h' 2 if and only if h[ = h\ and h 2 = h 2 . Also, for any injective homomorphism /i from 
F to G that coincides with g on S 1 we can decompose it into an injective homomorphism hi from 
to G[S U Qi] and another injective homomorphism /i 2 from F[R + ] to G[«S U Q2] such that Qi € C g , 
Q 2 £ 7£ s and Qi n Q 2 = 0- Then inj g (F, G) = C g M K g and hence 

mj(F,G)=J2£g®K g (5) 
g 

Proposition 4 (ll8l 1161 ). Given two families A and B together with weight functions a : A — > N arcc? 
/3 : i3 — > N we can compute the disjoint sum AMB in time 0(n(\ J, .4| + | | where n is the number 
of distinct elements covered by the members of A and B. Here [ A = {X : z\A 6 A, X C ^4}. 

We would like to use Proposition |4]together with Equation © in order to compute inj(F, G). Thus, 
given the mapping g : S — > V(G) we need to compute £ g , lZ g , a g and a g . Listing C g and 1Z g can be 
done easily in ( fc " 2 ) + ( fc ™ 2 ) time, so it remains to compute efficiently a g and aj*. 

Lemma 3. Let G be an n-vertex graph, F be a i-vertex graph of treewidth t, S C V{F) and g be a 
function from S to V(G). There is an algorithm to compute inj 9 (F, G[Q U g(S)]) for all £ — \S\ sized 

subsets Q ofV(G) \ g(S) in time O*{(Y? =t ©) 1 nP )- 

Proof. We claim that the following inclusion-exclusion formula holds for mj g (F, G[Q U g{S)]). 

mi g (F,G[QUg(S)\) = ^ (-l)l T l-l x lhom 9 (F, G[X U g{S)\) (6) 

XCQ 

To prove the conectness of Equation ©, we first show that if there is an injective homomorphism / 
from F to G[Q U g(S)] coinciding with g on S then its contribution to the sum is exactly one. Notice that 
since \S\ + \Q\ = \V(F)\, all injective homomorphisms that coincide with g on S only contribute when 
X = Q and thus are counted exactly once in the right hand side. Since we are counting homomorphisms, 
in the right hand side sum we also count maps which are not injective. Next we show that if a homomor- 
phism h from F to G[S U Q], which coincides with g on S, is not an injective homomorphism then its 
total contribution to the sum is zero, which will conclude the correctness proof of the equation. Observe 
that since h is not an injective homomorphism it misses some vertices of Q. Thus h(V(F)) n Q = W 
for some subset W C Q. We now observe that h is counted only when we are counting homomorphisms 
from F to G[X U g{S)] such that W C X. The total contribution of h in the sum, taking into account 
the signs, is 




Thus, we have shown that if h is not an injective homomorphism then its contribution to the sum is zero, 
and hence Equation (© holds. 

Observe that since \Q\ = £ — \S\, we can rewrite (— l^H^ as (_i^-|SH^I_ Define j(X) = 
(-l)H s H*lhom fl (F, G[X U g(S)}), then we can rewrite Equation © as follows: mj g (F,G[Q U 
g(S)]) = 7C(Q)- We start by pre-computing a table containing j(Q') for every Q' with \Q'\ < £ — \S\. 
To do this we need to compute hom 9 (F, G[Q' U g(S)]) for all subsets Q' of V(G) \ g(S) of size at 
most I — \S\. There are at most Yl^i^ (?) sucn subsets, and for each subset Q' we can compute 
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hom 9 (F, G[Q' U g(S)]), and hence also a(Q') using the dynamic programming algorithm of Diaz et 
al. 02 in time 0*{n p ). Now, to compute 7 C(Q) for all Q C V(G) \ g{S) of size £ - \S\ we apply 
the algorithm for the trimmed zeta transform (ALGORITHM Z) from Q. This algorithm runs in time 
0*(£'=i S| ("))■ Thus the total running time of the algorithm is then 0*((Y%=? (")) ■ vP). This 
concludes the proof. □ 

We are now in position to prove the main theorem of this section. 
Theorem 3. Let G be an n-vertex graph and F be a k-vertex graph ofpathwidth p. Then we can solve 
the #SUBGRAPH ISOMORPHISM problem in time C*(( fc / 2 ) n2p ) an d space C*(( fc / 2 ))- 

Proof. We apply Proposition 0] together with Equation §5$ in order to compute inj(F, G). In particular, 
for every mapping g : S — > V(G) we list C g and lZ g and compute a g and using the algorithm from 
Lemma[3] Finally, to compute C g M lZ g we apply Proposition HI 

The sum in Equation (f5]) runs over ( n )pl < n p different choices for g. For each g, listing C g 

and computing otg, and listing 1Z g and computing a^, takes C*((fc/ 2 -|S|) nP ) an ^ ^*((fc/2) nP ) ^ me 
respectively. Finally, computing C g M 1Z g takes time 0*((&/ 2 )) - Thus the total running time for the 
algorithm to compute inj(F, G) is O* (( k n /2 )n 2p ). 

To compute the number of occurrences of F as a subgraph in G, we use the basic fact that the number 
of occurrences of F in G is inj(F, G)/aut(F) 0. Since aut(F) = inj(F, F) we can compute aut(i ? ) 
using the algorithm for computing inj(F, G) in time C*(( fc / 2 )« 2p ) = 0*(2 h n 2p ). This concludes the 
proof of the theorem. □ 

4.2 Polynomial Space Algorithm 

In this section we give a polynomial space variant of our algorithm presented in the previous section. 
Our proof is similar to the one described by Bjorklund et al. |{8 ] for the #&;-Path problem. We will also 
need the following proposition which gives a relationship between inj(F, G) and hom(F, G). 

Proposition 5 ([5]). Let F and G be two graphs with \ V(G)\ = \V{F)\. Then 

mj(F,G) = (-l) |H/| hom(F,G[y(G)\iy]) = (_i)|v|-|W| hom (F, G[W]). 

WCV(G) WCV{G) 

Theorem 4. Let G be an n-vertex graph and F be a k-vertex graph ofpathwidth p. Then we can solve 
the #SUBGRAPH ISOMORPHISM problem in time O* (( fc " 2 )2 fc n 3p t 2 *) and polynomial space. 

Proof. By Equation © we know that inj (F, G) = £ ff £ g M1Z g . We first show how to compute C g Kl 1Z g 
for a fixed map g : S — > V(G). For brevity, we use the Iverson Bracket notation: [P] = 1 if P is true, 
and [P] = if P is false. 

C g mn g = Y, E [M^N = $]a L g {M)l3f{N) 

M&Cg N£lZg 

= E E E (-l) m [XCMnN]a^M)^(N) 

MeCg NGTlg {XCV(G),\X\<k/2} 

E (-1)' X| E E [XQM]][XczN]a^M)^(N) 
{Xcv(G),|X|<ifc/2} MeCgNeTZg 

E E E tfw) 

{A-CV(G),|A-|<fc/2} MeC g ,M^X NeTlg,NDX 

k/2 

= E E E «, L w)( E C7) 

i=l {XCy(G),(X|=i} MeC g ,MDX Ne1Z g ,NDX 
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For every M E C g , by Equation ©, we know that the following inclusion-exclusion formula holds 

for a%(M). 



a^(M) = mi g (F[L + ],G[MUg(S)]) = £ (— l)' A/ ' _ ' M ''hom g (F[L + ], G[M' U g(S)]) 



M'CM 



We can compute hom g (F[L + ], G[M' U g{S)\) in C*((nt) 2p ) time and polynomial space using a variant 
of Theorem 12 For details please see Appendix 16.51 Hence, using this we can compute oik (M) in time 
0*(2\ M \{nt) 2p ). Similarly we can compute <xf{N) in time 0*{2\ N \ (nt) 2p ) for every N G K g . Now 
using Equation (O we can bound the running time to compute C g M !Z g as follows: 



g((I)(i2i:>*( 2ll| (-) 2 ')^I)(iV->* (2lR|( " f)2p ' 

< j2 ( 2 " /2 (|2|) °* ( 2 ' L| ( nt ) 2p ) + 2fc/2 (|^|) °* ( 2,i?l ( n *) 2p )) 



k/2 

< 

i=l 



This implies that the time taken to compute inj(F, G) = Y, g £g^ K g is 0*(2 k { k n /2 )n 3p t 2t ), as the total 
number of choices for g is upper bounded by < n p . Finally, to compute the number of occurrences 
of F in G, we use the basic fact that the number of occurrences of F in G is inj (F, G) / aut(F) @ as in the 
proof of Theorem[3] We can compute aut(F) = inj(F, F), using the polynomial space algorithm given 
by Proposition for computing inj(F,G) and TheoremEl in time Yli=i (i)C*((M 2p ) = 0*{2 k k 4p ) 
and space polynomial in k. This concludes the proof of the theorem. □ 

Theorems |3] and @] can easily be generalized to handle the case when F has treewidth at most t by 
observing that if tw(F) < t then pw(i ? ) < (t + 1) log(/c — 1) [18] and that Theorem [2] works for graphs 
of bounded treewidth. 



5 Conclusion 

In this paper we considered the SUBGRAPH ISOMORPHISM problem and the #S UBGRAPH ISOMOR- 
PHISM problem and gave the best known algorithms, in terms of time and space requirements, for these 
problems when the pattern graph F is restricted to graphs of bounded treewidth or pathwidth. Count- 
ing graph homomorphisms served as a main tool for all our algorithms. We combined counting graph 
homomorphisms with various other recently developed tools in parameterized and exact algorithms like 
meet-in-middle, trimmed variant of Yates's algorithm, the DISJOINT Sum problem and algebraic circuits 
and formulas to obtain our algorithms. We conclude with an intriguing open problem about a special 
case of the Subgraph Isomorphism problem. Can we solve the Subgraph Isomorphism problem 
in time 0*(c n ), c a fixed constant, when the maximum degree of F is 3? 

Acknowledgements. We thank Mikko Koivisto for pointing us to an error in the previous version of this 
manuscript and for useful discussions. 
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6 Appendix 



6.1 Tree width, Path width and Nice Tree-Decomposition 

A tree decomposition of a (undirected) graph G is a pair (U, T) where T is a tree whose vertices we will 
call nodes and U = ({Ui \ i G F(T)}) is a collection of subsets of V(G) such that 

i- u e y ( T) c< = nco, 

2. for each edge vw G E(G), there is an i € V(T) such that v,w <E U, and 

3. for each u G V(G) the set of nodes {i \ v G L^} forms a subtree of T. 

The f/j's are called bags. The width of a tree decomposition ({C/j | i G V(T)}, T) equals maXj g y( T ){|[/j 
1}. The treewidth of a graph G is the minimum width over all tree decompositions of G. We use notation 
tw(G) to denote the treewidth of a graph G. When in the definition of the treewidth, we restrict our- 
selves to path, we get the notion of pathwidth of a graph and denote it by pw(G). We also need a notion 
of nice tree decomposition for our algorithm. A nice tree decomposition of a graph G is a tuple (U, T, r) 
, where T is a tree rooted at r and (U, T) is a tree decomposition of G with the following properties: 

1. T is a binary tree. 

2. If a node r G T has two children, say t\ and r%, and U T = U Tl = U T , 2 then it is called join node. 

3. If a node r has one child n, |I7 T | = |£/ Tl | + 1 and {7 Tl C f/ T then it is called introduce node. 

4. If a node r has one child n, |C/ n | = |Z7 T | + 1 and {J T C {7 Tl then it is called forget node. 

5. If a node r is a leaf node of T then it is called base node. 

Given a tree-decomposition of width t, one can obtain a nice tree-decomposition of width t in linear 
time. 

6.2 Proof of Proposition |2] 

Proof. Starting with the root node of $ we do a depth-first evaluation. At any instance we need to 
store the labels of nodes along a path under exploration which will require O(dlogs) space. We also 
need to store the partial evaluations along the path under exploration. Since output of a depth d formula 
with x\ = 1, . . . , x n = 1 as inputs can be stored using most d bits, we would require 0{d) bits per 
node in the path. Hence 0{d 2 ) bits are required for the whole path. Hence the total space required is 
0{d{d + \ogs)). □ 

6.3 Proof of Lemma |2] 

Proof. Given an input gate u of $ with the label (T, r, U, S, ip), we can describe the algorithm as follows: 

1. Check if U = U T and ip is indeed a homomorphism between F[U T ] and G[S] else FAIL. 

2. If t is a leaf node then set count = and return. 

3. If t is a join node then the node (T, r, U, S, t/j) has two children in <1> (see equation [B hence set 
count = 2. If i < 2, then return the label of the i th child according the canonical ordering. 

4. If r is an introduce node and t\ is the only child of r in T, then set count = 1 and if i = 1 then 
return (T, n,U Tl ,S\ {ip(u)}, ip\u Tl } e l se FAIL. (See equation[2]) 

5. If r is a forget node then set count = \Y\. If i < \Y\ then return the i th tuple (T, n, U n , 5 U 
{v}, ib v ) in the canonical ordering of \ (T, n, U T1 , S U {v}, ib v ) > . (See equation[3]) 
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Time and space analysis: The running time of the algorithm is clearly upper bounded by Size(3>) = 
0*{{nt) t ). For space, note that we can store T using O(ktlogn) bits as there are 0{k) nodes in 
T, each containing a bag of size at most t + 1. Storing a label of the form (T, r, U, S, ip) requires 
0(log/c + tlogn + ilogra + 2ilogn) = 0(logk + tlogn) many bits of space. Steps 1-4 of the 
algorithm can be implemented within this space bound. However, step 5 requires a storing of the set 

R = \ (T, tl, U T , , S U {v }, ip v ) \ and then choosing the i th element in their lexicographic ordering. 

However as \R\ = \Y\ < \V(G)\ = n, we can store all these labels in 0(n(logfc + tlogn)) bits of 
space. So, the overall space requirement is 0((n + fc)n(log k + t log n)) bits. □ 

6.4 Proof of Proposition |3] 

Proof. The vertices of a graph F of pathwidth p can be ordered as v\ . . . Vk such that for any i < k 
there is a subset Si C {v% . . . Vi} with \Si\ < p, such that there are no edges of F with one endpoint 
in {v\ . . . Vi] \ Si and the other in {vi+%, . . . vt}- Such an ordering is obtained, for example, in ifTTl . 
Choose V = {v\ . . . frfc/2i}> S = S^/ 2 -\, L = L' \ S and R = {v^k/2]+i ■■■ v k}- Then L, S and R 
have the claimed properties. □ 

6.5 Variant of Theorem |2] 

For the proof of TheoremHl we need a variant of Theorem[2] In TheoremlU we need hom g (F, G[X U S] ) 
instead of hom(F, G[X U 5]). This can be achieved if we can compute the polynomial, 

Vq(xi,. . . ,x n ) = ^2 Y[ x *(u), 

$eHOM 9 (F,G) u&V{F) 

by an arithmetic formula of size n°( p \ where HOM 9 (F, G) is the set of all homomorphisms from F 
to G that coincide with the function g : S — > V(G), for S C V(F). We obtain the desired circuit by 
modifying the constructions given in Lemma[T]as follows: 

• the sum in Equation dU) runs over HOM g (F, G[X U S]); and 

• in Equation © we ensure that the sum is restricted to rp v 's that agree with g on S C V(F). 

With an argument similar to that of Lemma [T] and using the fact that tw(i ? ) < pw(i ? ) we conclude 
as follows. 

Lemma 4. Let F and G be given two graphs with \V (F)\ = k and \V(G)\ = n. Let X, S QV(G) and 
g : V{F) — > S be any function. Then the polynomial Vq{x\, . . . , x n ) is computable by an arithmetic 
formula of size 0*((np) 2p ) where p is the pathwidth of F. 

Now, by applying Proposition [2] and Lemma [2] we get a polynomial space algorithm to compute 

hom g (F,G[XuS}). 
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